CPU and GPU real-time filtering methods for dense surface metrology using general matrix to matrix multiplications

نویسندگان

چکیده

Abstract Filtering is a required task in surface metrology for the identification of components relevant automated quality control. The calculation real-time features about crucial to determining mechanical and physical properties inspected product. computation efficiency filtering operations major challenge metrology, as current sensors provide massive volumes data at very high acquisition rates. To overcome challenges, this work presents different solutions comparing performance on CPU GPU, using modern hardware. proposed framework focused techniques that can be expressed finite impulse response (FIR) kernel includes Gaussian kernel, most common technique recommended by ISO ASME standards. This research proposes variations double FIFO circular filters. filters are transformed into series general matrix multiplications, which run extremely efficiently architectures. approach provides superior compared with previous works. Additionally, tests carried out quantify GPU terms transfer capabilities order diminish penalty imposed from main memory operations. Based results, an efficient batch faster than even small profile sizes, offloading host optimal system application response.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

"Wide or tall" and "sparse matrix dense matrix" multiplications

This note explores sparse matrix dense matrix (SMDM) multiplications, useful in block Krylov or block Lanczos methods. SMDM computations are AU , and V A, multiplication of a large sparse matrix m × n matrix A by a matrix V of k rows of length m or a matrix U of k columns of length k, k << m, k << n . In a block Lanczos or Krylov algorithm, matrix matrix multiplications with the ”tall” U and ”w...

متن کامل

Implementing Matrix Multiplications on the Multi-Core CPU Architectures

Recent commercial microprocessors are concentrating on the multi-core CPU architectures, while most parallel and/or distributed computing methods focus on the multi-CPU architectures. Therefore, there are needs to analyze and adapt traditional parallel algorithms for the new multi-core environments. In this paper, we use matrix multiplications as the target problem, and implemented it using var...

متن کامل

Systolic arrays for matrix multiplications

متن کامل

Sparse-matrix vector multiplication on hybrid CPU+GPU platform

Sparse-matrix vector multiplication(Spmv) is a basic operation in many linear algebra kernels.So it is interesting to have a spmv on modern architectures like GPU. As it is a irregular computation CPU also performs compares to GPU. So it is interesting to have this routine in hybrid architectures like CPU+GPU.So we have designed a hybrid algorithm for Spmv which uses a CPU and a GPU. We have ex...

متن کامل

Heterogeneous Sparse Matrix Computations on Hybrid GPU/CPU Platforms

Hybrid GPU/CPU clusters are becoming very popular in the scientific computing community, as attested by the number of such systems present in the Top 500 list. In this paper, we address one of the key algorithms for scientific applications: the computation of sparse matrix-vector products that lies at the heart of iterative solvers for sparse linear systems. We detail how design patterns for sp...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Real-time Image Processing

سال: 2022

ISSN: ['1861-8219', '1861-8200']

DOI: https://doi.org/10.1007/s11554-022-01204-4